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(2) RELATED APPEALS AND INTERFERENCE 

There are no related appeals or interferences known to appellant, the appellant's legal 
representative, or assignee which will directly affect or be directly affected by or have a bearing 
on the Board's decision in the pending appeal. 

(3) STATUS OF THE CLAIMS 

During the prosecution of the instant application, claims 1-74 were originally presented 
for examination. Claims 1-15 were cancelled, and Claims 16-51 are withdrawn. Claims 52-74 
remain pending. A copy of all the pending claims is presented in the Appendix. 

(4) STATUS OF AMENDMENTS AFTER FINAL 

The amendment filed January 25, 2005 subsequent to the final rejection was entered and 
deemed to overcome the rejections under §1 12, second paragraph, §102, and §103. See, 
Advisory Action mailed January 25, 2005. The amendment April 25, 2005 subsequent to the 
final rejection was not entered and is not relied upon for the purposes of this Appeal. 

(5) SUMMARY OF THE INVENTION 

Claims 64-74 are directed to methods of identify ing a protein from a library of individual 
proteins that binds to a target of interest. Claims 52-63 are directed to the libraries, per se. The 
claimed method recites certain structural claimed aspects, including "protease sensitive sites" and 
"identifier sequence tracts which are unique" possessed by the individual proteins in the library. 
The identifier sequence tracts can be used to recover ("identify") a protein in the library which 
has the desired binding characteristics after cleavage at the protease sensitive site. See, e.g., 
Claim 65(iv) and Claim 75(iv). Specification, Page 2, lines 25-30; Page 3, lines 23-29; Page 4, 
lines 16-40. The claimed method is generally useful for identifying proteins that possess an 
activity of interest. The term "barcode" is also used in the specification to refer to the "identifier 



2 



MERCK-2309 



sequence tracts." Specification, Page 2 lines 21-29. The terms are used interchangeably in the 
arguments below. 

(6) ISSUES 

There are no prior art rejections. The sole issues remaining in this application are: 

Whether the claims are unpatentable for lack of written description under §112, first 
paragraph and utility under § 101. These grounds are treated together for the purposes of this 
Appeal since the issues are intertwined, and if the §1 12 issue is overcome, the §101 would fall, 
as well. 

(7) GROUPING OF THE CLAIMS 

Claims 52-56 and 58-74 stand or fall together. Claim 57 does not stand or fall apart from 
the others since it recites specific protease cleavages site for enterokinase and Factor Xa that 
have specific structural characteristics. See, specification, Page 3, lines 7-8. 

(8) APPELLANTS 1 ARGUMENTS 

It is alleged by the Patent Office that: "The claims or the specification does not recite for 
any specific structure of the library." See, Office action dated August 25, 2004, Page 4. This is 
not correct. Claim 52 and others clearly recite specific structural features, e.g., "individual 
identifier sequence amino acid tracts which are unique to said individual protein when bound to 
the specific target of interest, and are flanked by one or more protease sensitive sites ..." 
(Underlining added.) In Claims 65 and 74, e.g., these structures (i.e., identifiers and protease 
sensitive sites) are utilized to identify a protein with desired binding characteristics. This useful, 
novel, and unobvious method can be applied to many different kinds of proteins, including 
antibodies (e.g., Specification, Page 1, lines 12) and other binding moieties. A utility of the 
claimed invention involves the use of the identifiers and protease sensitive sites to select proteins 
with certain characteristics. The focus on the particular proteins to which the identifiers and 
protease sensitive sites are joined is misplaced. The same reasoning would preclude any general 
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method from being patented. 

Identifier sequence amino acid tracts (also known as "barcodes") are fully described in 
the specification. The barcodes are amino acid tracts encoded for by oligonucleotides that are 
attached, in open reading frame, to the nucleotide sequence encoding the polypeptide of interest. 
A protease sensitive-site is placed between them to facilitate release of the peptide encoded by 
the barcode sequence. The polypeptide is expressed and then selected on the basis of an activity, 
e.g., a binding activity. Those polypeptides which possess the activity of interest can be then 
subjected to protease cleavage to cause the release of the barcode peptide. The peptide can be 
directly sequenced or analyzed by mass spectrometry to determine its amino acid sequence. This 
information is reversed translated into an oligonucleotide sequence that can be utilized to 
specifically amplify (by polymerase chain reaction) the polypeptide possessing the activity of 
interest. See, e.g., Specification, Page 2, line 25-Page 30, line 30. 

The design of identifier sequence amino acid tracts (i.e., barcodes), including their 
structures, is described throughout the specification. For example, on Page 4, lines 23-44, an 
eight-amino acid barcode sequence using 17 of the 20 natural amino acids is described. A 
specific example is provided of a family of peptide barcodes and their corresponding 
oligonucleotide sequences. See, Page 5, lines 1-35. Example 2, beginning on Page 29 of the 
specification, provides a specific working example of barcode sequence. See, Page 30, lines 5- 
15. Another example is disclosed on Page 35, lines 30-35. Thus, the specification provides 
adequate written description of the barcode sequence, including specific structural examples. 

Protease sensitive sites were well-known in the scientific community at the time the 
application was filed. The specification describes the structures for enterokinase, Factor Xa, and 
thrombin cleavage sites (which are also widely used in molecular biology). See, e.g., Page 3, 
lines 5-10 and Page 23, line 34-Page 24, line 2. A V8 protease cleavage site is also disclosed. 
See, Page 30, lines 5-15. Thus, specific structures of protease-sensitive sites are described in the 
specification. An Appellant is not required to disclose every species encompassed by their 
claims. In re Angstadt, 537 F.2d 498, 502-503, 190 USPQ 214, 218 (CCPA 1976). Other 
protease sites were known in the art. These include, e.g., enterokinase (Hopp et al., 1988), 
thrombin (Eaton et al., 1986; Manoharan et al., 1997), collagenase (Gehring et al., 1995), tobacco 
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etch virus (U.S. Pat. No. 5,532,142), IgA protease (U.S. Pat. No. 5,427,927), and dipeptide- 
specific proteases, such as aminopeptide B and carboxypeptides B, E, and N (U.S. Pat. No. 
5,506,120). See, Exhibit 1 for a copy of the non-patent publications. A patent need not teach, 
and preferably omits, what is well known in the art. In re Buchner, 929 F.2d 660, 661, 18 
USPQ2d 1331, 1332 (Fed. Cir. 1991); Hybritech, Inc. v. Monoclonal Antibodies, Inc., 802 F.2d 
1367, 1384, 231 USPQ 81, 94 (Fed. Cir. 1986), cert denied, 480 U.S. 947 (1987); and 
Lindemann Maschinenfabrik GMBH v. American Hoist & Derrick Co., 730 F.2d 1452, 1463, 221 
USPQ 481, 489 (Fed. Cir. 1984). 

It is not correct that all functional descriptions of genetic material do not meet the written 
description requirements. This was expressly stated in Enzo Biochem Inc. v. Gen-Probe, 63 
USPQ2d 1609, 1613 (Fed. Cir. 2002). As pointed out by the Enzo Court, antibody claims are 
considered to be in compliance with §112, first paragraph, even though no structural features of 
the antibody are disclosed in the specification. Similarly, the identifier and protease sites are 
sufficient to comply with the written description requirements. (Indeed, the present examiner has 
allowed claims reciting "exonuclease sensitive site" which is analogous to the feature recited in 
the pending claims that are now alleged to lack written description. See, U.S. Pat. No. 
6,322,969). 

The examiner's reasoning in relying on Eli Lilly and Fiers on Page 7 of the Office action 
dated August 25, 2004, is neither relevant nor appropriate. In those cases, the Appellant had 
discovered a single sequence for a protein from one mammalian species, but was attempting to 
cover the entire genus of mammalian proteins. This is not the case here. Appellant is claiming a 
general method of identifying proteins, and a library that is useful in this claimed method, not a 
specific protein as in the Eli Lilly and Fiers cases. Appellant is not claiming beyond what is 
described and enabled in the specification, i.e., the general method and use of a library having 
certain structural features that make it useful in the method. 

Claims of such type and scope have been granted by the Patent Office (e.g., System to 
detect protein-protein interactions, U.S. PatNos. 5,283,173; 5,468,617; and 5,667,973). There is 
no statutory reason to preclude it. 
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The allegation on Page 5 of the Office action that " ... there was no immediately apparent 
[sic] or 'real world' utility as of the filing date" is preposterous. Not only does the specification 
provide a number of uses of the claimed library and methods, but beginning on Page 26 of the 
Specification, several actual and working examples are described in which proteins with a 
binding affinity to a target were identified. This clearly is a substantial, specific, and credible 
utility. 

(9) Conclusion 

In view of the arguments and authorities presented above, Appellants request that the 
Examiner's action in making and maintaining the rejection under 35 USC §101 and §112, first 
paragraph, be reversed and that the application be allowed. 



Respectfully submitted, 




RichartJ M. Lebovitz (Reg. No. 37,067) 
Attorney for Appellant(s) 

MILLEN, WHITE, ZELANO & BRANIGAN, P.C. 

Arlington Courthouse Plaza I, Suite 1400 

2200 Clarendon Boulevard 

Arlington, Virginia 22201 

(703) 812-5317 [Direct Dial] 

(703) 243-6410 [Facsimile] 

Internet Address: lebovitz@mwzb.com 



Filed: June 24, 2005 
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Claim 16 (Withdrawn) A method of screening a protein library comprising screening said 
library for one or more desired properties, followed by dereplication to identify one or more 
individual proteins in the library having the desired property. 

Claim 17 (Withdrawn) A method as claimed in claim 16 wherein the library is screened 
for binding to a target moiety. 

Claim 18 (Withdrawn) A method as claimed in claim 17 wherein binding is detected by 
mass spectrometry, particularly matrix-assisted laser desorption/ionization time-of-flight 
(MALDI-ToF) spectrometry. 
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Claim 19 (Withdrawn) A method as claimed in claim 16 wherein the library is screened 
for a specific biological activity. 

Claim 20 (Withdrawn) A method as claimed in claim 17 wherein the target is a complex 
mixture, eg a mixture of molecules, whole cells or cell membranes. 

Claim 21 (Withdrawn) A method of protein identification and/or sequencing comprising 
providing a library of individual proteins, one or more of which may bind to a target of interest, 
wherein each individual protein, together with its gene, is bound to an "associating moiety". 

Claim 22 (Withdrawn) A method as claimed in claim 21 wherein the library of proteins is 
brought into contact with the target of interest either before or after the "associating moiety". 

Claim 23 (Withdrawn) A method as claimed in claim 21 wherein after screening for 
binding to the target the library is dereplicated to identify one or more proteins with a desirable 
property, proteins which bind to the target. 

Claim 24 (Withdrawn) A method as claimed in claim 21 where the "associating moiety" is 
a particle. 

Claim 25 (Withdrawn) A method as claimed in claim 24 wherein the particle is a latex 
bead. 

Claim 26 (Withdrawm) A method as claimed in claim 21 wherein the "associating moiety" 
is a protein or protein complex. 

Claim 27 (Withdrawn) A method as claimed in claim 26 wherein the "associating moiety" 
is avidin or steptavidin and each of the proteins in the library and their associated genes are 
biotinylated. 
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Claim 28 (Withdrawn) A method as claimed in claim 21 wherein the Dassociating moietyD 
is a bispecific binding molecule capable of binding to both the proteins and genes. 

Claim 29 (Withdrawn) A method as claimed in claim 21 wherein the Dassociating moietyD 
is a living cell or cellular virus such as a bacteria or bacteriophage. 

Claim 30 (Withdrawn) A method as claimed in claim 21 wherein one or other molecules 
which alter the properties of the proteins in the library are bound to the Dassociating moietyDPt 

Claim 31 (Withdrawn) A method as claimed in claim 21 wherein the genes encoding the 
proteins in the library are are attached to the Dassociating moietyD^rior to synthesis of the 
individual proteins. 

Claim 32 (Withdrawn) A method as claimed in claim 21 wherein the library of proteins is 
a library of antibody proteins, eg a library of antibody domains such as Fvs. 

Claim 33 (Withdrawn) A method of protein identification and/or sequencing comprising 
providing a library of individual proteins, one or more of which may bind to a target of interest, 
wherein each individual protein is attached to an individual "coding moiety". 

Claim 34 (Withdrawn) A method as claimed in claim 33 wherein the "coding moieties" 
are particles with unique identifier "codes". 

Claim 35 (Withdrawn) A method as claimed in claim 34 wherein the "codes" are different 
ratios of measurable signal, eg fluorescent, chemiluminescent or radioactive labels, or a physical 
feature such as a unique marking. 
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V 

Claim 36 (Withdrawn) A method for analyzing mixtures of proteins comprising: 

(iii) digestion or cleavage of the protein mixture; 

(iv) fractionation of the resultant peptides; and 

(v) analysis of the resultant peptides by means of their mass and/or sequence. 

Claim 37 (Withdrawn) A method as claimed in claim 36 wherein the fractionation in step 
(ii) is carried out using a library of protein binding agents. 

Claim 38 (Withdrawn) A method as claimed in claim 36 wherein the resultant peptides are 
subjected to physical fractionation and/or chemical tagging as part of the fractionation step. 

Claim 39 (Withdrawn) A method as claimed in claim 36 wherein the resultant peptides are 
subjected to addition of one or more amino acids as part of the fractionation step. 

Claim 40 (Withdrawn) A method as claimed in claim 37 wherein the library of protein 
binding agents is a library of antibodies or antibody fragments. 

Claim 41 (Withdrawn) A method as claimed in claim 37 wherein the protein binding 
agents are major histocompatibility proteins, T cell receptors and natural proteins or protein 
domains involved in protein-protein binding interactions, such as SHI domains. 

Claim 42 (Withdrawn) A method as claimed in claim 40 wherein the library of protein 
binding agents is pre-selected for binding to one or more proteins or peptides derived from the 
protein mixture or a related protein mixture under analysis. 

Claim 43 (Withdrawn) A method as claimed in claim 42 wherein the protein mixture is 
derived from a normalised recombinant gene library. 
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Claim 44 (Withdrawn) A method as claimed in claim 36 wherein the protein mixture is 
initially bound to a solid phase prior to digestion or cleavage either via the N or C-terminus or 
via specific amino acids or via specific sequences of amino acids. 

Claim 45 (Withdrawn) A method as claimed in claim 36 wherein specific amino acids or 
modified amino acids found in the proteins are derivatised prior to binding to a solid phase, such 
binding occurring either before or after digestion or cleavage of the protein mixtures. 

Claim 46 (Withdrawn) A method as claimed in claim 45 wherein the specific, or modified 
amino acids are derivatised with biotin prior to binding to avidin or streptavidin. 

Claim 47 (Withdrawn) A method as claimed in claim 45 wherein specific, or modified, 
amino acids are derivatised with ligands prior to binding to ligand-specific affinity reagents. 

Claim 48 (Withdrawn) A method as claimed in claim 36 wherein specific naturally 
modified amino acids found in the proteins are bound to a solid phase using modification specific 
affinity reagents, such binding occurring either before or after digestion or cleavage of the protein 
mixtures. 

Claim 49 (Withdrawn) A method as claimed in claim 45 wherein more than one cycle of 
digestion/cleavage and derivatisation is carried out. 

Claim 50 (Withdrawn) A method as claimed in claim 49 wherein mass analysis is carried 
out after each cycle of digestion or cleavage. 

Claim 51 (Withdrawn) A method as claimed in claim 36 wherein peptides released after 
digestion/cleavage are fractionated using physical methods such as HPLC before or after 
fractionation using protein binding agents. 
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Claim 52 (Previously Presented) A library of individual proteins, one or more of which 
being able to bind to a target of interest, each of said proteins comprising within its amino acid 
sequence, or terminal to it, one or more individual identifier sequence amino acid tracts which 
are unique to said individual protein when bound to the specific target of interest, and are flanked 
by one or more protease sensitive sites, said library thus providing proteins comprising a 
diversity of said individual identifier sequence tracts. 

Claim 53 (Previously Presented) A library of claim 52, wherein said identifier sequence 
tracts have been generated randomly or semi-randomly. 

Claim 54 (Previously Presented) A library of claim 52, wherein an individual protein of 
said library comprises multiple identifier sequence tracts. 

Claim 55 (Previously Presented) A library according claim 54, wherein an individual 
protein comprises two adjacent identifier sequence tracts. 

Claim 56 (Previously Presented) A library according to claim 52, wherein said protease 
sensitive site is a recognition site for endoprotease digestion. 

Claim 57 (Previously Presented) A library of claim 56, wherein the recognition site is the 
site for enterokinase or Factor Xa. 

Claim 58 (Previously Presented) A library according to claim 52, wherein said target of 
interest is a protein. 

Claim 59 (Previously Presented) A library of claim 52, wherein the individual proteins of 
the library are antibodies or recombinant antibody domains, which comprise antibody variable 
regions. 
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Claim 60 (Previously Presented) A library of claim 59, wherein the antibody domain is an 
Fv domain consisting of a VH and a VL chain. 

Claim 61 (Previously Presented) A library according to claim 60, wherein each chain has 
its own identifier sequence tract. 

Claim 62 (Previously Presented) A library according to claim 59, wherein said identifier 
sequence tract is C-terminal to the Fv domain sequence. 

Claim 63 (Previously Presented) A library according to 59, wherein the target of interest 
is an antigen. 

Claim 64 (Previously Presented) A method of identifying a protein which binds to a target 
of interest, wherein the protein is a member of the library of claim 52, comprising: 

(i) bringing each of the individual proteins of the library comprising said one or more individual 
identifier sequence tracts and said protease sensitive site(s) in contact with one or more of said 
targets of interest under conditions for an individual protein of said library to bind to said target 
of interest to form a complex, 

(ii) isolating the complex formed by the individual protein and the target of interest, 

Claim 65 (Previously Presented) A method of claim 64, further comprising: 

(iii) digesting said complex to cleave said protease sensitive sites releasing said individual 
identifier sequence tract(s), and 

(iv) determining said released sequence tract(s) by which said individual protein is finally 
identifiable. 

Claim 66 (Previously Presented) A method of claim 64, wherein the individual protein 
has been encoded by a DNA construct comprising nucleic sequences coding for said one or more 
individual identifier sequence tracts and said protease sensitive site(s). 
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Claim 67 (Previously Presented) A method of claim 66, wherein said DNA construct is 
part of a vector system. 

Claim 68 (Previously Presented) A method according to claim 64, wherein the individual 
protein and target of interest bind in solution. 

Claim 69 (Previously Presented) A method according to claim 64, wherein the complex of 
protein / target is isolated by removal of non-bound molecules. 

Claim 70 (Previously Presented) A method according to claim 65, wherein digestion of 
said complex is achieved by endoprotease. 

Claim 71 (Previously Presented) A method according to claim 65, wherein determination 
of the released identifier sequence tract(s) is achieved by mass spectrometry. 

Claim 72 (Previously Presented) A method of claim 71, wherein the mass spectrometry is 
MALDI-ToF. 

Claim 73 (Previously Presented) A method according to claim 64, wherein the individual 
proteins of the library are antibodies or antibody domains having a variable region and the targets 
of interest are antigens. 

Claim 74 (Previously Presented) A method of identifying a protein which binds to a target 
of interest wherein the protein is a member of the library as defined in claim 52, comprising the 
following steps: 

(i) bringing each of the individual proteins of the library comprising said one or more individual 
identifier sequence tracts and said protease sensitive site(s) in contact or association with one or 
more of said targets of interest, 

(ii) isolating the complex formed by the individual protein and the target of interest, 
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(iii) digesting said complex to cleave the introduced protease sensitive sites releasing said 
individual identifier sequence tract(s), and 

(iv) determining said released sequence tract(s) and using such sequence information to recover 
the individual protein library member from the library, and 

(v) including one or more further rounds of screening and enrichment for a protein which binds 
to the target of interest. 
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Proteolytic Processing of Human Factor VIII. Correlation of Specific Cleavages 
by Thrombin, Factor Xa, and Activated Protein C with Activation and Inactivation 

of Factor VIII Coagulant Activity 

Dan ijil^pfc Henry Rodriguez, and Gordon A. Vehar* 
Department of Protein Biochemistry, Cenentech, Inc., South San Francisco, California 94080 

Received June II, 1985 

abstract: Human factor VIII was isolated from commercial factor VIII concentrates and found to consist 
of multiple polypeptides with molecular weights ranging from 80000 to 210000. Immunological and amino 
acid sequence data identified these polypeptides as subunits of factor VIII. N-Terminal amino acid sequence 
analysis determined that the M r 210000 and 80000 proteins are derived from the N- and C-terminal portions 
of factor VIII, respectively; M r 90000-180000 polypeptides are derived from the M t 210000 polypeptide 
by C-terminal cleavages. Treatment of purified factor VIII with thrombin resulted in proteolysis of M x 
80 000-210000 proteins and the generation of polypeptides of M T 73 000, 50 000, and 43 000. Maximum 
coagulant activity of thrombin-activated factor VIII was correlated with the generation of these polypeptides. 
The proteolysis as well as activation of factor VIII by thrombin was found to be markedly dependent on 
CaCl 2 concentration. Proteolysis of factor VIII with activated protein C (APC) resulted in degradation 
of the M f 90000-210000 proteins with the generation of an M t 45 000 fragment. This cleavage correlated 
with inactivation of factor VIII by APC. The M T 80000 protein was not degraded by APC. Factor Xa 
cleaved the M T 80000-210000 factor VIII proteins, resulting in the generation of fragments of M r 73 000, 
67 000, 50000, 45 000, and 43 000. Factor Xa was found to initially activate and subsequently inactivate 
factor VIII. Activation by factor Xa correlated with the generation of M r 73 000, 50000, and 43 000 
polypeptides while inactivation correlated with the cleavage of M x 73 000 and 50000 polypeptides to fragments 
of M t 67 000 and 45 000, respectively. The cleavage sites in factor VIII of thrombin, factor Xa, and APC 
were identified by amino acid sequencing of the fragments generated after cleavage of factor VIII by these 
proteases. Interestingly, factor Xa was found to cleave factor VIII at the same sites as APC and thrombin. 
This may explain why factor Xa activates as well as inactivates factor VI II, 



I^irification of factor VIII (antihemophilic factor) from 
plasma indicates that its coagulant activity is associated with 
multiple polypeptide chains having molecular weights ranging 
from 80000 to 210000 (Vehar & Davie, 1980; Fass et al. f 
1982; Fulcher & Zimmerman, 1982; Rotblat et al., 1985). 
Recently, cDNA clones encoding the entire factor VIII protein 
sequence have been obtained (Toole et ah, 1984; Wood et al. f 
1984). The amino acid sequence deduced from such clones 
predict* a mature single-chain protein (2332 amino acids) 
having a molecular weight of — 300000 (Wood et al., 1984; 
Toole et al., 1984). Sequence data obtained from the protein 
chains of purified factor VIII preparations have been shown 
to overlap with the sequence predicted from the cDNA clones 
(Toole et al., 1984; Vehar et al., 1984), and the purification 
of a single-chain precursor having a M t > 300 000 has been 
reported (Rotblat et al., 1985). Thus, if factor VIII circulates 
in plasma as a single-chain form, it is partially degraded during 



its purification, yielding a form with multiple polypeptide 
chains. 

Amino acid sequence analyses also revealed the orientation 
of the protein chains associated with factor VIII to the sin- 
gle-chain precursor deduced from the cDNA sequence (Vehar 
et al., 1984; Toole et al., 1984). Such data show that the M t 
210000 and 80 000 proteins represent the N-termina! and 
C-terminal portions of factor VIII, respectively (Vehar et al., 
1984; Toole et ah, 1984). It is proposed that several proteolytic 
cleavages on the C-terminal side of the M, 210000 protein 
generate a series of proteins with molecular weights between 
90 000 and 180000 (Vehar et al., 1984; Toole et al., 1984). 

Recently, thrombin activation of factor Vin coagulant 
activity has been shown to be associated with specific pro- 
teolysis of factor VIII protein chains (Vehar & Davie, 1980; 
Faas et al., 1982; Fulcher et al., 1983, 1984; Loller et al., 1984; 
Rotblat et al., 1985). During thrombin activation of purified 
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human factor VIII. proteins with M f 1 10000-210000 appear 
to be proteolyzed to generate an M r 90000 protein, while the 
M t 80000 protein is cleaved to an M t 73000 fragment (Fulcher 
et al., 1983). Fulcher et al. (1983) suggest that the M t 90000 
and 73 000 proteins are the active subunits of thrombin-ac- 
tivated factor VIII and that cleavage of the M t 90000 protein 
by thrombin (yielding fragments of A/, -50000 and -43000) 
inactivates factor VIII coagulant activity. In contrast, studies 
with purified porcine factor VIII suggest that cleavage of an 
M f 82000 protein, which is analogous to the M T 90000 moiety 
of human factor VIII, results in further activation of factor 
VIII coagulant activity (Fass et al., 1982; LoUer et al., 1984). 
Other than species differences, the reason(s) for this dis- 
crepancy is (are) unknown. 

The inactivation of human factor VIII by activated protein 
C (APC), a vitamin K dependent plasma protease, has also 
been correlated with limited proteolysis of the factor VIII 
protein (Fulcher et al., 1984). Cleavage of factor VIII with 
APC results in the proteolysis of the M t 90000-210000 
proteins with the concomitant appearance of an M t 45000 
fragment (Fulcher et al.. 1984). 

The above-mentioned studies clearly show that specific 
proteolytic processing of factor VIII regulates factor VIII 
coagulant activity. In this report, we compare the effects of 
thrombin, factor Xa [also known to activate factor VIII (Vehar 
& Davie, 1980; Davie et al., 1975)], and APC on factor VIII 
coagulant activity and correlate the changes in activity with 
changes in factor VIII subunit structure. Furthermore, most 
of the cleavage sites of these proteases have been identified 
by amino acid sequence analyses of the fragments generated 
by the proteolysis of factor VIII. Knowledge of these sites not 
only allows the cleavage patterns of these proteases to be 
compared but also begins to illustrate the basis of the mech- 
anisms that alter factor VIII coagulant activity. 

Materials and Methods 

Human factor Xa, human activated protein C (APC), and 
human a-thrombin were all generous gifts from Dr. Walter 
Kisiel (The University of New Mexico). Affi-gel 10 was from 
Bio-Rad; rabbit brain cephalin and phcnylmethanesulfonyl 
fluoride (PMSF) were from Sigma Chemical Co.; Platelin was 
obtained from General Diagnostics; factor VIII deficient and 
normal human plasmas were from George King Biomedical; 
factor VIII chromogenic Coatest assay was from Helena. 
Bio-Gel A- 15m void volume fractions enriched in factor 
VHI/von Willebrand factor (vWF) complexes were prepared 
from commercial concentrates and were a generous gift of 
Cutter Laboratories and Dr. D. Schroeder. 

Purification of Human Factor VIII. Commercial factor 
VIII concentrate from Cutter Laboratories was resolved on 
a Bio-Gel A-l5m column as described by Fay et al. (1982). 
The V 0 fraction containing factor VIII coagulant activity was 
made 1 mM PMSF and 35 mM 0-mercaptoethanol. This 
results in the reduction of von Willebrand factor (vWF)/ factor 
VIII complexes, which has been shown to cause their disso- 
ciation without significantly affecting factor VIII coagulant 
activity (Vehar Sc Davie, 1980; Savidge et al., 1979). Also, 
the functional and structural properties of the factor VIII 
preparations isolated here are very similar to factor VIII 
preparations isolated by others in the absence of reducing 
agents (Fulcher & Zimmerman, 1982; Rotblat et al.. 1985). 
The reduced V 0 was batch-separated with DEAE-Scpharose 
that had been equilibrated in a 0.02 M imidazole. pH 6.9, 
bufTer containing 0.15 M NaCl, 0.01 M CaCl„ 0.02 M glycine 
hydrochloride ethyl ester, 5% glycerol, and 1 mM PMSF (VIII 
buffer). Twenty milliliters of DEAE-Sepharose was added 
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for every liter of V 0 fraction. After being stirred for 2-3 h 
at 4 °C, the resin was poured into a column and washed with 
5 column volumes of VIII bufTer. Factor VIII was step-eluted 
with VIII buffer containing 0.1 1 M CaCl 2 . A factor VIII 
monoclonal antibody column was prepared by coupling 10 mg 
of factor VIII monoclonal antibody to 2 mL of Affi-gel 10 
(Wood et al M 1984). The resulting column was equilibrated 
in 0.05 M imidazole, pH 6.9, bufTer containing 0.15 M NaCl, 
0.01 M CaCl 2 , 5% glycerol, and 1 mM PMSF. The factor 
VIII DEAE pool was applied to the antibody column, and the 
column was washed with 50 column volumes of the above 
buffer. Factor VIII was eluted with the same buffer containing 
1.0 M KI. Fractions containing factor VIII activity were 
pooled and dialyzed against 0.05 M tris(hydroxymethyl)- 
aminomethane (Tris), pH 7.5, 0.15 M NaCl, 2.5 mM CaCl 2 , 
5% glycerol, and 1 mM PMSF and stored at -70 °C. Factor 
VIII activity was measured either by coagulation analysis or 
by the factor VIII chromogenic Coatest assay as described by 
Wood et al. (1984). Protein concentration was determined 
by the method of Bradford (1976). 

Cleavage of Factor VIII by Thrombin, Factor Xa, and APC. 
For N-terminal amino acid sequence analysis, approximately 
0.5-1.0 mg of factor VIII was incubated with either thrombin, 
factor Xa, or APC at a 1/50 ratio (w/w). In the case of factor 
Xa and APC, 7i 0 th sample volume of rabbit brain cephalin 
was included in the reaction as a source of phospholipid. After 
1-2 h at 37 °C, the reaction was stopped by adding sodium 
dodecyl sulfate (SDS) to 0.4% and immediately heating the 
samples to 80 °C. Proteolyzed factor VIII was subsequently 
resolved on 5-10% polyacrylamide gradient gels in the presence 
of SDS [SDS-polyacrylamide gel electrophoresis (PAGE)]. 
Electrophoresis was carried out according to the method of 
Laemmli (1970). After staining with Coomassie blue, factor 
VIII peptides were excised and electroeluted according to the 
method of Hunkapiller et al. (1983). Gel-eluted peptides were 
subjected to N-terminal amino acid sequence analysis using 
an Applied Biosystems gas phase sequenator (Hewick et al., 
1982) modified for on-line phenylthiohydantoin identification 
(H. Rodriguez, unpublished results). 

For subunit and activity analysis during proteolysis, aliquots 
of factor VIII (1 10 Mg/mL, 400-700 units/mL) in 0.05 M 
Tris, pH 7.5, 0.15 M NaCl, 2.5 mM CaCl 2 , and 5% glycerol 
were incubated with either thrombin (1.5 Mg/mL), factor Xa 
(2 Mg/mL) or APC (4 Mg/mL) for CM 20 min (thrombin, Xa) 
or 0-30 min (APC). Rabbit brain cephalin 0/io lh sample 
volume) was added in reactions containing APC or factor Xa. 
At the end of each time point, a 10-mL aliquot of the reaction 
was removed, diluted into 0.05 M Tris, pH 7.3, containing 
0.2% bovine serum albumin (BSA), and assayed by coagula- 
tion analysis. To the remainder of the aliquot was added SDS 
to 0.5%, and the sample was immediately heated to 80 °C. 
Proteolyzed factor VIII was subsequently resolved on 6-12% 
SDS-polyacrylamide gels. Proteins were visualized by silver 
staining (Morrissey, 1981). 

Results 

Purification of Factor VIII. Factor VIII was purified by 
initially resolving plasma concentrates on a Bio-Gel A- 15m 
column as previously described (Fay et al., 1982). This allowed 
the partial purification of von Willebrand factor (vWF)/fector 
VIII complexes which elute in the void volume. These com- 
plexes were dissociated by reduction with 0-mercaptoethanol 
and resolved by chromatography on DEAE-Sepharose. Factor 
VIII obtained from the DEAE-Sepharose chroma tograph was 
subsequently purified to homogeneity by using a factor VIII 
monoclonal antibody column. A typical purification suiting 
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peptides may generate the M, 45 000 fragment (Figure 3). The 
JJtSoo and 67000 polypeptides <jcr,ve<l by cleavage 
of the A/ 80000 protein at positions 1689 and 1721. respec- 
Jilfy (Hgure3). The Af, 67 000 polypeptide also appears a. 
ft doublet on SDS-PAGE. r - 

cTeavage of factor VIII by APC results in proteolysa oiM, 
90000-210000 proteins with the appearance of an M, 45 wu 
ra^nt (FigurTi). The M, WW^^g^f 
Kv APC These result* are aimilar lo those or Fulchcr ei ai. 
O^ tIc "tcrrnina. sequence of the M 45 000 foment 
was found to be the same as the N-ter«una. * 
M, 90000-210000 protein. (Figure ^ Therefore. thu 
fragment is derived from the N-tcrminal of factor V 111. J ne 
APC cleaves factor VIII to generate th» ta.nj.rt 
has not been determined. This cleavage, however. »PP^ 
be the same cleavage made by factor Xa that 
an hi 45000 fragment from the N-term.nal of facto VHI. 
Qeavage «Uhi. site by factor Xa is not due to eo.um.net.ng 
ScK anUbodics against APC didnot M.bttbe»r Xa 
from making this deavage (data not shown). By SDS-PAGE, 
wTc^dWt reproducibly detect the C-termma poruons o 
the M, 90000-210000 proteins after proteolysis of facua V1H 
by APC. In F.gure 6. polypeptides with molecular we.gbts 



of -47 000 ~49000. and 67000 are apparent after APC 
deavage -fhe appearance of these polypeptides, however wa, 
only transitory. Also, only one sequence was observed when 
the M 45000 fragment was sequenced. 

AalvoiZn of Factor V1U by Thrombin. A t.me course 
treatmem or factor VIII with catalytic amounts of thrombin 
rcTuSn a 36-fold increase in factor VIII coagulant act,v.ty 
(Figure 4). After maximum activity was reached, throm- 
bin aclL ed factor VIII appeared to remain stable for attest 
I h at 37 »C. F.gure 4 does show a slight decrease .n activity 
it the 2-h timeVint; however, in other <M*"menU. 
decrease was not seen. Factor VIII that was not acuvat^ with 
Sotn remained stable throughout the M. t.rr* cow* (data 
«T^«wn^ Analysis of factor VIII subunit structure during 
Z nZt^^L that factor VII, c^gujant actWUy 
dramatically increases with the genera t.on of '"8™""° £ 
73000 50000. and 43000 (Figure 5). Thrombm therefore, 
to activate factor VIII by initially cleav.ng M, 
&2%S) p-teins to generate the M «0«» Pj-J 
which is subsequently cleaved to po 1ypept.de. of M 50000 
and 43 000. Occurring concomitantly «^ va 8 c °[ ^ 
80000 protein to an M, 73000 polypepl.de. The* results 
Wicate Aat fuUy activated factor VIII may consul of subun.ls 
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fkiure 4: Activation of factor VIII by thrombin and the effect of 
CaCij on thrombin activation. Factor VIII (1 10 Mg/mL. ~700 
uniu/mL) was incubated for the times shown at 37 *C with thrombin 
(1.5 Mg/mL) in the presence of 2.5 (•), 10 <0), or 50 mM CaClj 
(A). At each time point, factor VIII coagulant activity was determined, 
and the reactioo was stopped by the addition of SDS as described 
under Materials and Methods. 

of M r 73000, 50000, and 43000, with the Utter two origi- 
nating from the N- terminal portion and the M t 73 000 from 
the C-tcrminal portion of the factor VIII precursor protein. 

As shown in Figure 4, increasing the CaCl 2 concentration 
significantly alters the extent to which factor Vin is activated 
by thrombin. At 10 mM CaCl* only a 13-fold activation is 
observed, while at 50 mM CaCI 2 factor VIII is not activated 
by thrombin (Figure 5). These results are similar to the recent 
findings of Hultin (1985), who has shown that activation of 
partially pure factor VIII by thrombin in inhibited by CaCl 2 . 
Comparison of the subunit structure of factor VIII activated 
by thrombin at 2.5, 10, and 50 mM CaCI 3 shows that at the 
higher CaCI 2 concentrations (10 and 50 mM) the proteolytic 
processing of factor VIII by thrombin is limited (Figure 5). 
At 10 mM CaClj, the M r 90000 and 80000 polypeptides are 
only partially cleaved to the A/ r 73000, 50 000, and 43000 
subunits, while at 50 mM CaClj factor VIII is not cleaved at 
all by thrombin (Figure 5). Interestingly, at 10 mM CaCl 2 , 
after maximum activity is achieved there appears to be very 
little change in factor VIII subunit structure yet coagulant 
activity greatly diminishes (Figures 4 and 5). The reasons for 
this decrease in activity are presently unknown. Here we show 
that maximum activation and proteolytic processing of factor 
VIII by thrombin appear to occur at physiological CaCl 2 
concentration (2.5 mM), while partial activation or proteolysis 
occurs at higher CaCl 2 concentrations (Figures 4 and 5). In 
the absence of CaClj, factor VIII wai proteolyzed and acti- 
vated by thrombin similarly to factor VIII treated by thrombin 
in the presence of 2.5 mM CaCl 2 (data not shown). 
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Inactivation of Factor VIII by APC. Treatment of factor 
VIII with APC resulted in a dramatic decrease in coagulant 
activity (Figure 6). Correlated with the decrease in activity 
is the cleavage of Af r 90000-210000 proteins with the con- 
comitant generation of an Af r 45000 polypeptide (Figure 6, 
inset). The M, 80000 protein is not significantly proteolyzed. 
As previously discussed, the N -terminal amino acid sequence 
of the M T 45000 polypeptide is identical with the N-tcrminaJ 
sequence of the M t 90000-210000 proteins (Figure 3), and 
the exact site at which this cleavage occurs is presently un- 
known. However, as proposed above for factor Xa, APC may 
cleave at position 336. This site precedes a very acidic region 
(15 Asp-Glu; 4 Lys/Arg; total of 42 amino acids) of the M r 
90000-210000 proteins. Significantly, cleavage of the M t 
90000 protein at position 372, immediately following this 
acidic region, generates the AS, 50000 and 43 000 subunits of 
thrombin-activated factor VIII. Taken together, this suggests 
that the acidic region between positions 336 and 372, shown 
in Figure 3, u of functional importance. 

Activation of Factor VIII by Factor Xa. Similar to 
thrombin, factor Xa cleaves factor VIII at position 372 of the 
M r 90000 protein and at position 1689 of the M r 80000 protein 
(Figures 2 and 3). These cleavages would result in activation 
of factor VIII, as is the case for thrombin. However, factor 
Xa also appears to cleave factor VIII at the same site that APC 
proteolyzes factor VIII (Figure 2 and 3). This cleavage at 
position 336 would inactivate factor VIII. These results 
suggest that factor Xa would at best only moderately activate 
factor VIII and ultimately cause inactivation. Indeed, over 
a 2-h time course, factor Xa initially activated factor VIII only 
3-fold and eventually inactivated factor VIII (Figure 7a). 

During the time course treatment of factor VIII with factor 
Xa, the A/, 1 10000-210000 proteins were initially cleaved, 
with the major product being the M r 90 000 protein (Figure 
7b), This protein was cleaved to generate M r 50000, 45000, 
and 43 000 polypeptides (Figure 7b). Subsequently, the M T 
50 000 polypeptide appeared to be cleaved to the M r 45 000 
fragment This proteolysis correlates with the inactivation of 
factor VIII by factor Xa (Figure 7). Occurring concomitantly 
with cleavage of the M t 90000 protein is the cleavage of the 
M t 80000 protein to the Af T 73 000 polypeptide, which is 
subsequently cleaved to generate an A/, 67000 polypeptide. 
This latter cleavage also correlates with factor VIII inactivation 
(Figure 7). Whether this cleavage, itself, is sufficient to in- 
activate factor VIII has not been determined. 

Discussion 

Recently a detailed understanding of the primary structure 
of factor VIII was made possible due to the isolation of factor 
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VI II cDNA and genomic clones (Wood et al, 1984; Took et 
al., 1984; Giucbier et al, 1984). The deduced amino acid 
fcquence predicu a mature tingle-chain protein consisting of 
2332 amino acids which, after accounting for 25 potential 
N -linked glycosylation aites, indicate* that the lirigle-chain 
form of factor VIII has an U, > 300 000. This is supported 
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by the purification of single-chain factor VIII having M, 
> 300 000 from plasma (RotbUt et al.. 1985). The single-chain 
precursor form of factor VIII appears to be readily proteo!y»d 
in vivo and/or in vitro, yielding a species consisting of multiple 
subunits with M, 80000-210000. As stated previously, the 
protease which cleaves at position 1648 to generate the M, 
80000 protein is unknown as are the protease(s) and sites 
which generate the M t 1 10000-180000 proteins seen when 
factor VIII is resolved by SDS-PAGE (Figures 1 and 8). 

Detailed analysis of the factor VIII sequence revealed a 
triplicated domain structure. These domains each consiilof 
approximately 330 amino acids and are approximately 3D* 
homologous (Vehar et al, 1984). mterestingly. these domains 
also share approximately 30% homology with the triplicated 
domains of the plasma copper binding protein cerutoplaamin 
(Vehar et al., 1984). The importance of this homology as It 
pertains to factor VIII function is as yet not understood. The 
location of these domains within the factor VIII precursor ts 
shown in Figure 8. 

Thrombin activation of purified factor VIII correteteswith 
proteolysis at positions 740. 372, and 1689 (Figure! i)T These 
cleavages ultimately generate the M, 73000. 50000. and 
43000 subunits. Cleavage at position 740 removes the C- 
terminal region of the M t 1 10000-210000 proteins, generating 
the M, 90000 protein (Toole et al., 1984). Subsequently, the 
M 90000 protein is cleaved at position 372 to generate the 
M 50000 and 43000 subunits. This cleavage site is between 
two cerulopUsnun-like domains and follows an acidic spacer 
region (336-372) (Figure* 3 and 8). Cleavage of the 
80000 protein at position 1689 to generate the AT, 73 000 
subunit also follows an acidic region (positions 1649-168*) 
of factor VIII, which has tome sequence homology with the 
region between positions 336 and 372 (Vehar et al., 1984). 

Similar to results presented here, porcine factor VIII has 
been shown to be activated 70-fold by thrombin in the pr«fn« 
of 5 mM CaCl, (Fa** et al., 1982; Loller et al I.. 1984). This 
activation was correlated with the cleavage of an M, 82000 
protein to M, 44000 and 35000 polypeptide* and the cleavage 
of an M, 76 000 protein to a fragment of M, 69 000. The M, 
82000. 44000. and 35000 proteins of porcine factor VIII are 
analogous to the M, 90000, 50000, and 43000 
human factor VIII, while the M, 76000 ^ ^ poly- 
peptides of porcine factor Vlll are analogous to the M, 80000 
and 73 000 proteins or human factor VIII (Fa*s et al.. 1985). 
Interestingly, thrombin-activated porcine factor VIII was found 
to be unstable even though its subunit structure remained 
unchanged (Loller et al.. 1984). However, it could be sta- 
bilized by factor DC* and phospholipid, suggesting that changes 
other than proteolysis may cause inactivation of factor VIII 
coagulant activity (LoDer et al, 1984). This is cotnrtenl ; with 
the results of Hultin Sl Jetty (1981) and Rick & Hoyer 
(1977) who observed that inactivation of thTOmbin-acirvatec 
factor Vni was not mediated by active thrombin. Factor VIII 
activated by thrombin at 10 mM CaCI, becomes Inactive writ 
time even though there is little change in subunit structure 
after activation (Figure 5); it is therefore possible that 
thrombin-activated factor VIII may not be stable at noo- 
physiological high CaCI, concentrations. We show here that 
thrombin-activated factor VIII wu stable for at least 1 h at 
37 -C. This stability is probably a reflection of the high 
concentration (1 10 fig/mL) of factor VIII as well as the Caa 2 
concentration (2.5 mM) used in thrombin activation expen- 
ments. 

In contrast to our results and those of Fas* et al. (1982). 
Fulcher et al. (1983) observed that thrombin activation of 
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factor VIJI correlate* with the generation of M r 90000 and 
73 000 proteins. They suggest that cleavage of the M t 90000 
protein to M f 50000 and 43 000 polypeptides results in inac- 
Uvation of factor VIII. These investigators, however, activated 
factor VIII in the presence of 40 mM CaCl a . As shown here 
and as reported by Hultin (1985), this CaCl 2 concentration 
should limit the activation of factor VIII by thrombin (Figures 
4 and 5). Indeed, these authors only observed a 5-fold acti- 
vation. 

Inactivatlon of factor VIII by APC correlates with cleavage 
of M r 90000-210000 proteins and generation of an M, 45 000 
fragment (Figure 6). This result is similar to those reported 
by Fukher et al. (1984). The site at which APC cleaves has 
yet to be identified; however, as discussed previously, this 
cleavage could occur at arginine-336 (Figures 3 and 8). Since 
thrombin cleavage at position 372 of the M f 90000 protein 
causes activation (Figure 4), the acidic region between posi- 
tions 336 and 372 may be functionally important in regulating 
the activity of factor VIII. Recently, Guinto & Esmon (1984) 
have shown that factor V proteolyzed by APC does not interact 
with prothrombin or with factor Xa. It is tempting to speculate 
that this acidic region of factor VIII may be involved in 
protein /protein or protein /phospholipid interactions. 

Cleavage of factor VIII by factor Xa results in a modest 
activation followed by inactivation of factor VIII coagulant 
activity (Figure 6). This observation is consistent with the fact 
that factor Xa cleaves at the same sites as thrombin (372, 
1689) as well as APC (336). (Figure 8). The inactivation of 
human factor VIII by factor Xa has also been observed by 
Triantapbyllopoulus (1979). However, purified bovine factor 
VIII has also been shown to be activated and stable by factor 
Xa at a nonphysioJogical low pH (Vehar & Davie, 1980). 
Activation of human factor VIII by factor Xa appears to 
correlated with the generation of Af r 73 000, 50000, and 43000 
fragments, while inactivation appears to correlate with the 
cleavage of the Af, 50000 subunit to the M, 45000 polypeptide. 
Presumably, this cleavage occurs at the same site that APC 
cleaves factor VIII (Figure 3). Interestingly, H has been shown 
that inactivation of thrombin-activated porcine factor VIII by 
APC correlates with the cleavage of the M t 44 000 subunit, 
which is analogous to the kf \ 50000 subunit of human factor 
VIII (Fass et al., 1984). These results suggest that the M, 
50000 subunit a indeed a functional subunit of activated factor 
VTJJ. Factor Xa also cleaves the M x 73000 subunit at position 
1722 to generate an hf x 67 000 polypeptide. Whether this 
cleavage alters factor VIII activity has not been determined. 
However, this cleavage occurs within a ce miopias min-like 



domain and may release the first 20 amino acids of the A3 
domain (Figure 8). 

The functional as well as structural similarities of factors 
V and VIII have recently been recognized (Vehar et al. f 1984; 
Fulcher et al., 1983, 1984; Church et al., 1984; Fass et al., 
1985). Both of these proteins function as cofactors in the 
intrinsic coagulation pathway. Factor VIII in a complex with 
factor IXa. calcium ions, and a phospholipid surface functions 
in the activation of factor X. Factor V in a complex with 
factor Xa, calcium ions, and a phospholipid surface functions 
in the conversion of prothrombin to thrombin. Both proteins 
appear to circulate in plasma as large (Af r 300 000) single- 
chain precursors which are proteolytically processed to yield 
active cofactors (Mann et al., 1981; Rotblat et al. 1985). In 
both cases, the N- terminal (A/ r —90000) and C-terminal (A/ r 
—80000) portions represent the functional regions of these 
cofactors (Vehar et al., 1984; Toole et al., 1984; Esmon, 1979; 
Hibbard & Mann, 1980; Nesheim et al., 1984). Separating 
these regions in both factor V and factor VIII is a highly 
glycosylated region (A/ r —100 000). The function of this 
region is as yet undetermined (Vehar et al., 1984; Church et 
al., 1984). Amino acid sequencing also shows that factor V 
has homology with both factor VIII and ceruloplasmin 
(Church et al., 1984; Fass et al.. 1985). Factor V has also 
been shown to contain at least a duplication of the cerulo- 
plasmin-like domain (Fass et al., 1985). Factors VIII and V 
are also both activated by thrombin and factor Xa and inac- 
tivated by APC (Foster el al., 1983; Suzuki et al., 1983). 
Thrombin activation of either factor V or factor VIII results 
in the generation of polypeptides of Af, —90000 (from the 
N-termini) and —70000 (from the C-termini). In the case 
for factor VIII, the Af t 90000 polypeptide appears to be 
proteolyzed further to M t 50 000 and 43000 polypeptides. 
Factor Xa proteolysis of factor V is unlike that by thrombin, 
and factor Xa is less efficient than thrombin in activating 
factor V (Foster et al., 1983). The same appears true for the 
activation of factor VIII by factor Xa (Figure 7). APC ap- 
pears to inactivate both factor VIII and factor V by proteo- 
lyzing the N-terminal portion (molecular weight of approxi- 
mately 90000 subunit) of these factors (Figure 6; Suzuki et 
al., 1983). These data support the notion that factors VIII 
and V are regulated by thrombin, APC, and factor Xa in a 
very similar manner. 

In this study, we have identified the cleavage sites in factor 
VIII of thrombin, factor Xa, and APC. This has allowed us 
to determine how factor VIII is qualitatively processed by these 
proteases. Correlation of the changes in factor VIII subunit 
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structure due to proteolysis by thrombin, factor Xa, or APC 
with changes in factor VIII coagulant activity allowed the 
tentative identification of the functional subunits of activated 
factor VIII. The results presented here suggest that the 
generation of the M t 50000. 43 000, and 73 000 subumts 
correlates with complete activation of factor VIII. 
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A SHORT POLYPEPTIDE MARKER SEQUEN 
USEFUL FOR RECOMBINANT PROTEIN 
IDENTIFICATION AND PURIFICATION 

Thomas T.-BSfg., Kathry* S. Pricket!, Virginia L. Price, RandeU T. Litl 
March, Douglas Pat Cerretti, David L. Urdal and Paul I. Conlon si 
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A small hydrophilic peptide of eight ami- 
no acids (AspTyrLysAspAspAspAspLys) 
was engineered onto the N-terminus of a 
variety of recombinant lymphokines for 
the purpose of aiding in their detection 
and purification from yeast supernatants 
or E. coli extracts. An antibody specific for 
the first four amino acids of this sequence 
was used as a detection reagent and for 
immunoaffinity purification of products 
under mild conditions. Because 'of the 
small size of the peptide ■^Si«ty%tfd^ife : 
hydrophilic nature, the proteins were un- 
affected by its presence and^^n^d^ a 
high level of biological activity/ In addi- 
t^n, it was possible to remove the peptide 
^ enzymatic cleavage procedure us- 
ing eriterokinase. 



mong the many approaches taker) to improve 
the yield and pur;ty of recombinant proteins, 
one particularly useful procedure is to express 
. the desired polypeptide as part of a larger 
fusion protein ' Fusion to a protein-export signal se- 
quence has been used to cause secretion of products fro m 
yeast 11 and £. coli* cells. Furthermore, it has recently been 
recognized that an attached fusion polypeptide sequence 
might serve as an aid to identifying or purifying the 
product. For example, in several cases the added polypep- 
ude -segment contains a complete second protein that 
binds : to affinity columns via its specific substrate or 
ligand. These include (3-gaIactosidase fusion proteins that 
bind to aminophenylthiogalactosidyl Sepharose columns* 
ana. protein A fusion proteins that bind to immunoglob- 
ulin columns*. Such fusion proteins can be highly purified 
in good yields m a single step by passing cell extracts or 
supernatant* over columns of an appropriate affinity 
matrix, then eluting the purified fusion protein by chang- 
ing conditions so that binding is no longer possible A 
related approach is to use an antibody directed against the 
added sequence as a detection or affinity purification 
reagent'- although the high binding" affinity of 
most anusera and monoclonal antibodies often requires 
tne use of denaturing conditions for elurion of the prod- 
uct. r 

The fusion protein approach as several drawbacks that 
nave not been adequately addressed in the past. First 
most fusion protein products fail to fold properly into a 
nauve, active state". It is possible that the added polypep- 
tide segment is responsible for this misfolding due to 



unfavorable interactions during foldi^fl 
protein. This often necessitates treatmW 
denatu rants such as 8M urea and 7M ; :$ 
followed by refolding procedures' ! -«.'-^{^ 
A further problem with fusion proteirislSj 
difficult or impossible to remove the addM§r! 
carbbxyl-terminal sequence from the ^ ffi 
product. One solution has been to use^Jati 
conditions and chemical cleavage agentsYdchj 
70% formic acid 8 -'* or low pH mcubatffitS 
cleavage. However, recent studies have attei 
somewhat milder chemical cleavages su^Mf 
amine treatment at pH 9.0'< or enzymatig'feffl 
dures under physiological conditions. Tn'^br 
factor X; has a proteolytic specificity foW-SSi 
sequence IleGluGIyArg, and has been tfs!ep$ 
..globin^.froma kcll protein fusion seq'ueiJc^.jK 
allowing ^specific cleavage by coIIagcmslijBS 
proposed 6 . Sassenfeld and Brewer' 6 deyefqg^ 
ed ion-exchange, purification tcchniqu^^$P 
proteins to a C-terminal series of argiriiri^f 
are subsequently removed by carboxypepuL, 
ment. These enzymatic processes have beit^fM 
severaJ instances, but often have been 'HiftfiS 
cleavage yields or by unwanted cleavages that$e 
the desired protein sequence 10 . ' 

We decided to create a recombinant proM^L 
and purification system that incorporated &e$cM 
of the above mentioned procedures in order/to^F 
fusion sequence with a combination of the roo$| 
properties. Here we report the development of& 
terminal fusion sequence AspTyrLysAsp AspjfSL 
that we refer to as a marker sequence or "FlagSjM* 
for antibody mediated identification and purxffc 
recombinant proteins. We also describe >a 'WSt 
antibody that reacts with this sequence and caii^bcTi 
an immuno-afHnity purification reagent . tha#g 
marker fusion proteins under very mild concHuon? 
ly, the marker sequence can be removed by trea^ 
the protease, enterokinase, which is specific forth? 
terminal amino acids of the marker sequence' 
treatments arc required at any step in this pr&e'f 
proteins purified by this approach retained their?' 
cal activity throughout the purification, even -wL 
marker sequence was attached. This paper descnL 
expression of several such fusion proteins in Sacchjin 
cerevisiae and Escherichia coli. ' 

RESULTS 

In order to develop this system, we performed a m 
of interrelated steps. The eight amino acid marker J| 
tide was engineered onto the N-terminus of the iymSb 
kme mterleukin 2 (IL-2)" by means of synthetic oligS? 
cleoudes. The fusion protein was expressed in veastC 
the product punfied by conventional means, then us(& 
an unmunogen to produce a monoclonal antibody (4f!$ 
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c, for the marker sequence. The antibody func- 
q as a reagent for a number of different immuno- 
ical procedures including "Westerns," "dot blots, " 
Junpprecipi cations, and affinity purification when 
led to a solid support. Furthermore, the discovery 
e 4£1 1 antibody would release its antigen when 
-'m was removed from the medium led to the devel- 
n| of a mild purification procedure for fusion pro- 



tein elution from the affinity columns. Next, treatment of 
the fusion proteins with enterokinase demonstrated that 
the enzyme was capable of removing the marker segment 
efneiendy, with little or no observable degradation of the 
desired protein product. Finally, measurements of specif- 
ic activity demonstrated that, for all proteins tested, no 
appreciable loss of activity was caused by the presence of 
the marker segment on the N-termini of the recombinant 
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a -fa cxoT^-^^r^lL-Z cDNA 
leader /Marker 



Kpnl HgiAI 

GTA CCT TTG GAT AAA AGA GAT TAC A AG GAC GAC GAT GAC A AG GCA CCT ACT TCA 
GA A AC CTA TTT TCT CTA ATG TTC CTG CTG CTA CTG TTC CGT GGATGA AGT 

Pro Leu Asp Lys Arg Asp Tyr Lys Asp Asp Asp-Asp Lys Ato Pro'Thr Ser 

r— (I 1 | ^ 



a-foclor leoder 



KEX 2 



Marker 



J IL-2 
Enterokinase 



$rajd d!XY8, for expression of the LL-2 fusion 
^st^ The synthetic oligonucleotides used in con- 
i&fpias'mid cxtendedfrom the Kpnl site at the left 
$ice shown below the plasmid diagram, to a blunt 
,HgiAI site near the right side, ending with the 
$p.:C-termina! lys residue of the marker peptide 
^e. IL-2 coding sequence extended from a blunt 



end before the first codon of IL-2 (Ala) to.a StuI site beyond 
the termination codon. The arrows below mcr amino "a'cid" ; 
sequence indicate the sites of cleavage by the KEX2 protease 
to remove the a-factor precursor sequence from. the primary 
translation product, and by enterokinase, to -remove the 
marker peptide from the product protein. 
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|AGCT GAC TAC AAA GAC GAT GAC GAT AAA GC 

[CTG ATG TTT CTG CTA CTG CTA TTT CGTGGG 

Ala Asp -Tyr- Lys -Asp-Asp- Asp-Asp-Lys Ala-Pro 



FGGG) 



Sspl 



GM-CSF 




mid pRL6-6-8'7 f for expression of the GM-CSF 
i .-in E. coli. Abbreviations: Lpp*, lipoprotein 
.-lactose promoter-operator region; ompA sig- 
^UjVrane protein A signal peptide sequence; GM- 



loci** 



CSF, human granulocyte -macrophage colony stimulating fac- 
tor; Ipp"*, lipoprotein terminator; /ac/", lactose repressor; ori, 
origin of replication. 
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products. 

Secretion of fusion proteins from yeast. Figure 1 shows 
the plasmid pIXY8, used for production of the IL-2 
fusion protein in yeast. Similar constructs with the ADH2 
promoter replacing the a-factor promoter were used to 
express granulocyte colony stimulating factor (G-CSF) 18 , 
imerleukin 3 (IL-3) 19 , imerleukin 4 (IL-4) 20 and granulo- 
cyte-macrophage colony stimulating factor (GM-CSF) 21 
fusion proteins. All of these proteins were secreted into 
culture media by yeast. Each had the expected molecular 
weight for the correctly processed form (i.e. with leader 
peptide removed) and yielded the expected sequence of 
the marker peptide on N-terminaJ amino acid sequence 
analysis. The IL-2 fusion protein was purified by HPLC 
for use as an immunogen, while the other fusion proteins 
were purified by the 4E1 1 antibody immunoafrinity chro- 
matography procedure (below). For comparison, essen- 
tially identical vectors were prepared that expressed each 
protein without the marker peptide. These products were 
recovered from the yeast culture media and purified to 
homogeneity by conventional techniques including HPLC 
and ion exchange chromatography 22 . 

Expression of" GM-CSF fusion protein in E. coli. The 
construction of/the plasmid pRL6-6-87 for expression and 
secretion of. the -GM-CSF fusion protein in E, coli is 
oudined ih'Figure; 2. -This plasmid allows the secretion of 
the marker peptide GM-CSF fusion protein by means of 
the signal -peptide from the outer membrane protein. 
;QrnpA\ ;.,t=Keijji^duct!.pbtaLned. .from E. coli cultures had 
the' expected molecular weight for the marker peptide 
GM-CS F." f u siqn and . yield ed an N -termin al amino acid 
'sequence --corresponding - to the 'marker peptide sequence. 

The marker-specific antibody. The iso type of the 4E1 1 
antibody is lgG 2B, It was found to be reactive with 
proteins faring the marker peptide sequence in a variety 
of procedures, including ELlSAs, dot blots, Western blots, 
immunoprecipitation and affinity chromatography, as de- 
scribed beloyv. The antibody was found to react with all of 
the marker peptide fusion proteins that we have pro- 



duced. The antibody exhibits no reactiv^ 
marker products, or any component : p^' 
extracts, or in yeast culture medium. • 

Purification. Figure S shows the (rj 
affinity purification chromatograms oii^' 
made with the 4E1 1 antibody. In Fi^U^ 
supernatant obtained by fermenting j^ea 
GM-CSF expression vector was passed bVe 2 " 
purify the fusion protein that had been-Sy^ 
medium. Medium components were reiriq^ 
with PBS containing 0.5 mM CaCl 2 , wSjj$£t 
remained bound to the antibody. Subsequent 
PBS containing EDTA dissociated the:ma| 
antibody complex and released the GM-C[§ 
tein as a purified product. The multiple md)' 
species eluting from the column are typicj^ 
teins secreted from yeast and result fromT 
glycosylation by yeast cells. All bands Wer<£, 
GM-CSF, based on Western analyses using^, 
anti. GM-CSF monoclonal antibody as -jdjMj!* 
agenti. . ;;v^? 

Figure 3B shows the results of affinity crit^ 
of an extract of E.' coli. cells that had bceivy 
with pRL6-6-87 in order to produce the G 
protein. Chromatography was carried otitj ri 
yeast GM-CSF fusion, except that 1 mM.tfEif 
during washing and 0.1 M glycine HC1 pH,;S|^ 
..used to" elute.. the. product. The GM-CSE^fitst 
el u ted as a single.molecular weight speries^ 
does riot glycosylate proteins. The prcniuctj#| 
pure after this single chromatographic stepf^i) 
;;Th^ binding of the, . 4£J 1 anti bod yf'&$£ 
peptide is dependent on the presence lof\W 
property has been reported for a few othef.$ ( 
'the past? 9 . We observed that if insuffici^jiri' 
calcium^were present in washing buffer, tht^A 
proteins - would leak from the affinity x<£{[ 
though they had bound quantitatively whert;^ 
hatant dr.E. coli extract was passing over tije^ t 




nCWt 3 Affinity purification of recombinant fusion proteins. 
Panel A: Silver-stained polyacrylamide gel of Yeast GM-CSF 
fusion protein purification. Lanes are: 1: molecular weight 
standarda; 2: yeast supernatant; 3: flow through material; 4- 
6: sequential PBS/0.5 mM CaCl 2 washes; 7—15: "sequential 
PBS/2.0 mM EDTA elutions; 16: molecular weight standards. 
Panel B: Silver-stained gel of E. coli GM-CSF fusion protein 



purification. Lanes are: 1: molecular weight standards; 2) 
coli supernatant; 3: flow through material; 4-6: scqucn 
PBS/0.5 mM CaCl 2 washes; 7-11: sequential 0.1 M Gly-*, 
pH 3 elutions; 12: molecular weight standards. Numbers; 
right indicate M r values for the standard proteins (in kilo 
tons). The 94 K standard was omitted from the gel shown 
panel A. 



BEST AVAILABLE COPY 



the effect of various concentrations of CaCi^ in 
trig buffer, we determined that concentrations of 
Sim above 0.3 mM were necessary to retain the fusion 
Ceins on the column. Given this calcium dependence, it 
"'*o'und that rapid elution of fusion proteins could be 
'yed by using EDTA in the elution buffer. It was also 
tile to elute proteins simply by using a calcium free 
a buffer after 1 the columns had been washed with 
[jntaining calcium. However, under these conditions 
ted fusion protein tended to spread through more 
' than when the elution buffer contained EDTA, 
jrokinase treatment. Figure 4 shows the results of 
ase treatment of the IL-2 fusion protein. In- 
^'amounts of en tero kinase were added to identical 
^of the IL-2 fusion protein, then the samples were 
^d for 16 hrat S7 6 . As the concentration of enzyme 
"jgased, a component appeared at the molecular 
§£ ; ihe authentic IL-2, At the highest concentra- 
;\ enzyme, the conversion of fusion protein to 
"4t! f protein was complete, and an approximately 
'^ajenpunt of authentic product had been formed. 
i|w6t analyses using 4E1 1 and anti-IL-2 monoclo- 
^clies confirmed the identities of the products 
^jfljt silver gel. Only the higher molecular weight 
'^'reactive with 4Ell t but as expected both the 
IeI jlower molecular weight species were reactive 
£hti-IL-2 antibody. Amino terminal sequence 
£the resulting cleavage products indicated that 
ftion protein was cleaved after the second lysine 
^er peptide yielding the expected sequence for 
ue IL-2 N-terminus. As can be seen in Figure 4, 
is complete, and in this case results in only 
Vits of detectable lower molecular weight by- 
imed by enterokinase cleavage within the IL-2 
essentially the same cleavage pattern was ob- 
I'Jihe other fusions as well. In no case were 
products present in quantities greater than 
^grote^n (determined- by sequence analysis), 
pn^of biological activity. The marker peptide 
Iteins expressed in yeast and £. coli were all 
'fjatiive despite the presence of the marker 
>5fjseen in Table 1. In all cases the levels of 
i^yity obtained with the fusion proteins were 
^Vwild-type recombinant proteins expressed 
( irker sequence. In the case of GM-CSF the 
jjty values in Table 1 were obtained before 
^inoval of the marker sequence by enteroki- 
-fearly identical specific activities obtained indi- 
"dy.yield of cleaved product is probably near 



t er ; peptide fusion system described in this 
&es a unique and widely useful technique 
idintification and purification. In addition, 
ojiis have shown that the marker peptide is 
jvith heterologous expression systems: one, 
^Zi:pmpA signal and the yeast pre- pro alpha 
^sequences, when fused to the marker pep- 
y.^ly. processed by their respective proteases 
'ja.on of marker peptide-protein fusions with 
j|^minus. And second, because many inves- 
j&pported problems in the N -terminal proc- 
^ially expressed mammalian proteins 2 ,*-* 6 , 
£*jiie marker peptide to protect the N- 
desired procfuct may be another impor- 
?^tius system. 
. t fty to produce authentic N -termini upon 
|&j£atment is an improvement over a number 
protein approaches. Those that require 



chemical cleavages using Asp-Pro 12 or Asn-Gly is directed 
reagents, for example, must necessarily leave a proline or 
glycine at the N-terminus of the product. Although we 
have not yet tested the ability of enterokinase to cleave the 
marker sequence from N-terraini containing all of the 20 
possible amino acids, we have found that it is capable of 
cleaving products with N-terminal Glu, Ala, Thr, Leu and 
He residues. This suggests that this procedure will be 
useful for a wide range of N-termini including charged 
and uncharged, hydrophobic and hydrophilic residues. 

There are several requirements that should be met by 
an efficient detection and purification system based upon 
fusion polypeptide expression; First, the added marker 
segment should riot interfere with the native folding of 
proteins to which it is attached. Second, the marker 
peptide sequence should be intrinsically water soluble and 
should retain a high degree of exposure in the aqueous- 
environment of the protein, so that it can readily interact 
with the affinity purification substrate. Third, it should be 
useful in an affinity purification step that requires only 
very mild media, and be elutable witfi a non-denaturing 
and inexpensive eluant. Finally, the marker peptide 
should be easy to remove and the product protein should 
not have any amino acids added or deleted once the 
marker peptide has been removed. The Flag" 1 peptide 
fusion system was designed to possess all of these proper- 
ties, and our data with several recombinant proteins 
suggest that it may prove to be a universal purification 
system for proteins expressed in heterologous organisms. 

Several factors were considered in choosing the specific 
sequence of the marker peptide moiety. We chose to limit 
the marker peptide sequence to. only eight amino acids- 
because it can easily be encoded in a single synthetic 
oligonucleotide, and because the longest. trypsinogen pro- 
sequences are of this length.; We therefore could be 
reasonably sure that the trypsin-activating enzyme, en- 
terokinase, would work efficiently, to. release the peptide. 
Additionally, because antibopLies^recjuire.up to six dr,seven:. 
amino acids for avid binding interactions, we reasoned 
that eight amino acids should be the minimum sequence 
capable of strong binding to an* antibody while allowing 
one or more of the last amino acids on the C-termihal c'rid 
to act as a spacer to separate the antibody binding portion 
from interference with the bulk of the protein. Finally, the 
five C-termina! amino acids of the marker sequence 
represent the minimal enterokinase specificity site, 
AspAspAspAspLys. 

The choice of Lys at position ■ three of the marker 



TAHI 1 Expression level and specific activity of marker fusion 
proteins. 




'Determined by dot blot assay usinp 4E1 1 antibody to detect 
marker peptide containing material. 

"Proteins in this column (except £. coli GM-CSF) were pro- 
duced without the marker segment and purified by conven- 
tional means. 
c Not determined. 

d ln this case, specific activities were determined on the same 
sample before and after enterokinase treatment to remove the 
marker segment. 
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sequence causes the marker peptide to contain the hexa- 
pepiide sequence, LysAspAspAspAspLys, that has a maxi- 
mum value on the hydrophiliciry scale of Hopp and 
Woods 17 . Such maximally hydrophiiic sequences have 
been proven to express strong antigenicity and are corre- 
spondingly likely to adopt a highly exposed conformation 
in the three dimensional folding of a protein". As can be. 
seen in Figure 5, it is impossible for any other region of a 
protein to have a higher hydrophUicity value than this 
maximally hydrophiiic sequence, so the marker segment is 
virtually guaranteed to be exposed at the surface of any 
fusion Drotein. Therefore it can always be expected to be 
available for binding to antibody. Perhaps most impor- 
tantly, the strong predilection for extern aiization should 
guarantee that the marker segment will not interfere in 
the adoption of a native conformation by the remainder 
of the protein. 

In addition to the hydrophiiic effects of Lys at position 
three, several other considerations influenced the choice 
of amino acids at the N-terminus of the marker peptide. 
Aromatic amino acids have been recognized as major 
factors in antigen-antibody interactions* 9 so a tyrosine was 
placed at position 2, flanked by charged amino acids. 
Recent evidence suggests that aromatic residues that are 
flanked by charged sequences* are, more likely to be 
involved in antigenic sites than arc other aromatic resi- 
dues in less, polar environments 26 . The decision to place 
an Asp" residue at the N-terminus was' made in part 
because the negative charge on Lhe Asp should aid in 
exposing the Tyr to antibody, as mentioned above, and in 
~P a ri^tecause, with the inclusion of the Asp at position 1, a 
total of eight charges are to be fo unci on the marker 
, perjticl.e moiety, including the N-terminal amino group. 
*This : prep6ndcrance of charged residues was expected to 
mkke ; it likely that antibody binding would be heavily 
dependent on charge-charge interactions, and therefore 
might be highly susceptible to elution with commonly 
used salt solutions such as 2 M MgCI 2 or 1 M NaCl 9 . In the 
end, the serendipitous discovery that Ca +2 was involved in 
the charge-dependent binding of the marker sequence to 
the 4EU antibody made even these mild salt treatments 
unnecessary. 

We have seen that fusion, proteins retain the appropri- 
ate specific activity even With, the marker segment still 
attached, and that this '"'activity can be maintained after 
enterokinase treatment to remove the marker sequence. 
-Comparisons of several of these fusion protein products 
with their natural counterparts (Table 1) demonstrated 
that the presence of the marker did not decrease the 
specific activity of the fusion proteins relative to the same 
proteins with no extraneous amino acids added. We have 
recently begun using a larger version of the 4E1 1 column 
to prepare proteins in milligram quantities. This level of 
scale-up required no special procedures or equipment, 
and can still be done as a bench top experiment. Further 
scale up for production of gTam or kilogram quantities is 
contemplated, and will bti limited only by availability of 
antibody and enterokinase. One advantage of the 4E11 
antibody is that it can be purified on an affinity column 
comprised of chemically synthesized marker peptide at- 
tached to a solid support arid eluted with EDTA. 

One area that remains potentially problematic is the 
provision of adequate enterokinase for this process, both 
in terms of quantity and quality. We sampled commercial 
sources of enterokinase, but found that Our fusion pro- 
teins were digested into small fragments, presumably by 
contaminating chymotrypsin, trypsin and elastase that are 
likely to be present in these partially purified prepara- 
tions. Our own crude bovine intesunal preparations also 
caused substantial unwanted hydrolysis, until we used the 




nCSKI 4 Enterokinase digestions. The IL-2 fu'^o* 
incubated with increasing amounts of bovine/' 
and the digestion was analyzed by silver-stainingv 
are: 1: purified IL-2 fusion protein (200 ng^n 
fusion protein + 2 ng enterokinase; 3: fusion- p" 
enterokinase; 4: fusion protein + 20 ng entero 




, 

SEQUENCE POSITION Vi^> 
F1CWDJ 5 HydrophUicity plot of the IL-2 fusion 'jSot' 
profile was generated using the updated HYDRQjJV 
of Hopp 10 . The scale is oriented so that hydrophiiic 
hydrophobic U at bouom. Valley regions are ex/'"'^ 
buried portions of the polypeptide, whereas pealiaw 
ed to be exposed at the surface of the protein. The!'^ 
peak near the N-terminus results from the extrcmcf 1 ' 
philic hexapeptide, LysAspAspAspAspLys, con tarn e r 
the marker peptide sequence. 

: 

protocol of Liepnieks and Light* 0 for removrq 
other proteases. The fact that a minor amoun£ : o#; 
ed cleavage is seen in some cases when the marker^ 
is removed (Fig. 4) suggests that traces of conta^ 
proteolytic activity niay still be present in our? 
preparations of enterokinase. An ideal solution vV ; 
to clone enterokinase and express it in a 
organism. This would provide starting materr 
much lower levels of contaminating proteases, arid- 
higher levels of enterokinase. Furthermore, 



wr 



enterokinase gene in hand, it might be possible tS. 



neer a smaller form of the molecule, lacking the : K$ 
phobic portion that binds it to the membranes .6^ 
intesunal villi. This would simphfy purification an" 
creaste the mass of enterokinase needed for cleavage o* 
marker segment- y[ 
Despite the need for further development of enter 
nase, this system for fusion protein detection and puri 
tion already represents a useful technique. It offers 
possibility of using a single procedure for the purifica 
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ultiple fusion proteins. Although it is also possible to 
fy fusion proteins from yeasi medium or E. coh cells 
onvenuonal means such an ion exchange or reverse 
e chromatography, these procedures require new 
bd development for each new protein, whereas with 
'";fcr fusions, the same process is applicable to all 
^iiis. Finally, because the marker segment does not 
%r to have decreased the biological activity of any of 
Spteins that it has been placed on, it may not always 
qessary to remove the marker segment in order to 
^an active product. In such cases, this useful "han- 
Jp be retained on the molecule, enabling mvcsuga- 
readily detect and manipulate their recombinant 
n products. 

UMENTAL PROTOCOL 
;pUsmid construction. The yeast vector used for protein 
ton has been described previously 31 . This vector contains 
ops from pBR322 thai allow selection (Amp') and replica- 
coli t as well as the yeast Trpl gene and 2jt origin of 
4pri' for selection and autonomous replication in a trpl 
airi: Expression of Foreign genes is under control of the 
■^promoter* or the ADH2 promoter** and secretion is 
l£:Oy the a-factor leader peptide. To generate the IL-2 
*% vector pIXYB, the. mature codinz region of IL-2" was 
,$rame to the marker peptide and the a-factor leader by 
" ^ synthetic oligonucleotide linker encoding the five C- 
^rnino acids of the a-factor leader and the eight amino 
e'rharkcr peptide (Fig. 1). The vectors thai directed the 
'i-'qf the other products were generated by two modifi- 
$iXY8. First, the a-factor promoter was removed by 
tthe plasmid with EcoRI and Pstl, then inserting the 
jr&moter using a synthetic oligonucleotide linker. Sec- 
■njirker and appropriate protein coding sequences were 
place of the IL-2 sequence (Fig. 1) and linked with a 
iigonucleotide thai extended to the Hpal site. 
: ,«jf yeast strains. 5. cerevisiae strain _XV218/(a/a-trp-l) 
ny $p either selective medium [YNB "trp, consisting of 
^'Nitrogen Base (Difco), 0;5% Casamino acids, 2% 
$Lg/mJ adenine and 20 u,g/ml uraciI]or rich medium 
pJ^&g o f 1% Yeast Extract, 2% peptone and 1% 
L&piernen ted with 80' jig/ral ^adenine >. and ■ 80 * jpuj/ mj 
^^transformations were done by selecting for Trp 
"jfiji"-. Cultures were 'grown . for .biological assay by 
&0-50 mi of rich mediulrcj-with Irje, appropriate strain ; 
jg'ithc cultures at iSpAC-: to stationary .phase. Cells were . 
"eci by centrifugation and the medium was filtered 
v'&Sm- cellulose acetate filter. Sterile supcrnatcs were 
^■-'Larger scale fermentations were done in a 10 liter 
wjck Micro ferm fermehtor. Cells were removed from 
v 'i using a MUlipore Pellicon filtration system. 
JmOU ot the E. coli vector. Plasmid plN-IU-OmpA^ is 
Jfpression vector regulated by the uindem lpp p (lipopro- 
' v ter)/£oc po (lactose-promoter-operator) that contains a 
* IJ *A leader sequence for protein secretion 8 *. Construe- 
C;';GM-CSF expression vector was accomplished by 
^digestion at the unique BamHI site of pIN-111- 
■"tlbwed by iu conversion to blunt ends by treatment 
"^transcriptase (Boehringer-Mannheim). The vector 
,.^ntiy restricted with Hindlll, and used in a three- 
'Swith a synthetic oligonucleotide encoding the marker 
uencc and cDNA encoding GM-CSF to produce 
{as outlined in Figure 2. 

"of/.£. coli. plasmid pRL6-6-87 was introduced by 
Juori into E . coli strain JM 107, (AZac, pro, thi, strA, cndA t 
traD36,proAB + , lacF-ZAhilb) which was ^rown 
^irninimal medium containing 1% (w/v) methionine 
: '^DIFCO) and ampiciilin (50 u.g/ml) to an OD«>o of 
'^'were induced following addition of isopropyl^-D- 
;ranoside (1PTC) and cyclic 3 '-5' adenosine mono- 
|&MP) to 2mM and 4mM, respectively, and allowed 
*.';the GM-CSF fusion protein for 2-4 hr. Cells were 
'^mrifugation, and pellets cither stored at -70°C or 
"li»ed for extraction and purification of marker 
tju;' £. coli pellet were extracted by the following 
"e.picllet from 500 ml of culture was suspended in 50 
jM 'NaCl, 50 mM NaH 2 PO<, pHB.4 ( to which 1 mM 
^isulfonylnuoridc had been added immediately prior 
"*tef. freezing (-70"C) and thawing three times to lyBe 
tlL the sample was incubated at 37°C for 30 min to 



complete lysis and to extract the protein product, i he viscous 
extract was treated by dounce homogenization to achieve^ a 
uniform solution, then centrifuged at 25,000 g for 45 min at 4°C 
The supernatant was adiusted to 0.5 mM CaCl 3 . recentnfuged if 
necessary to remove any" resulting precipitate, then applied to the 
affinity column. 

Preparation of immunogens. Palmitic acid conjugated pep- 
tides were produced bv solid phase chemical synthesis as de- 
scribed previously". The antigenic marker peptide had the 
sequence AspTvrLysAspAspAspAspLysGiyProLysLysGly to 
which palmitoyl moieties had been attached on the epsiion amino 
groups of the two C-terminal lysines. It is referred to as CDP- 
marker (C-terminal dipalmitoyl marker peptide). A second pal- 
mitovi peptide, NDP-GM1 (N-tcrminal dipalmitoyl GM-CSF pep- 
tide 1) was used as a non-specific binding control. It had the 
structure LysGlyGlyGluSerPheLysGluAsnLeuLysAspPheLeu- 
ValGly, and also possessed two palmitoyl moieties, in this case 
attached to the two amino groups of the N-terxninal lysine 
residue. For purification of the IL-2 marker peptide fusion 
protein, supernatahts of yeast expressing the IL-2 fusion protein 
were applied to a reverse phase HPLC column. The IL-2 fusion 
product was eluted from the column using a gradient of ace ion i- 
rrile, as described previously* 8 . 

Immunization. BALB/c female mice were purchased from the 
Jackson Laboratories (Bar Harbor, ME) and maintained in our 
animal facility. Mice were immunized subcutaneously with 250 
u,g- of IL-2 fusion protein emulsified in Freund's complete 
adjuvant, followed with 125 u,g of the same protein emulsified in 
Freund's incomplete adjuvant four weeks later. Two weeks after 
the second inoculation, a serum antibody liter to IL-2 and the 
marker peptide was measured by "dot-blot". The animal was then 
challenged with 10 >tg of protein intravenously four days prior to 

fusion. . - 

Hvbridoma derivation. Four days after the intravenous boost, 
the animals were sacrificed! their spleens removed, and a single 
cell suspension prepared'.- The splenocytes were fused to the 
HAT sensitive myeloma cell, NS-1. The resulting hvbndomas 
were then assayed for the production of antibodies to the marker 
peptide seven to ten days- later by ELISA (see below). Qne 
hybridoma antibody consistently produced a positive reaction 
specifically with the marker peptide moiety. This cell'line, desig- 
nated 4EU, was then cloned by limiting dilution, tsotyped, and 
further characterized. - " . 

... sFI TSa.. Various peptide solutions (CDF-marker, or NDF- , 
GM1) were applied to HA plates (Millipore, Bedford, MA) at-a- 
concentration of 40 ng per well and allowed to incubate for SO 
niin aV room temperature; nonspecific protein binding sites were 
Blocked- by an incubation with S% bovine serum albumin in Tns 
buffered saline, pH 7.0 (TBS A) for 1 hr at room lemperarure. 
Hybridoma supernatants were added and the plates incubated 
for 1 hr. Following this incubation, the plates were washed with 
PBS and an alkaline phosphatase labeled goat anti-mouse anti- 
body (Sigma Chemical, St. Louis, MO) was added. Following a 1 
hr incubation, the plates were washed several time* with PBS and 
a colorimetric indicating reagent was added (substrate tablets, 
Sigma -Chemical). Contents of each HA plate were then trans- 
ferred to a polystyrene 96 well plate- (Unbro/Titertek, Flow 
Laboratories, McLean, VA) and. the absorbance at 405 nm 
determined on a Titerscan (Flow Laboratories). 

Production and purification of 4E11 andbodv. Pristane- 
primcd BALB/c mice were injected IP with 1 * 10 hybridoma 
cells. Ten to twenty days later, the ascitic fluid was recovered, 
centrifuged at 1000 x g for 30 minutes at 4°C, passed through 
cotton gauze, and the supernatant stored at -20°C until needed. 
The monoclonal antibody 4E11 was purified from ascites fluid 
using MAPS II Protein A Affigel (Bio-Rad, Richmond, CA) 
affinity chromatography. The purified antibody was found to be 
homogeneous by SDS-PAGE analysis. 

4E11 column preparation. Purified 4E1 1 immunoglobulin was 
concentrated by ultrafiltration. After dialysis against 0- 1M He P c5 
buffer, pH 7.5 at 4°C the antibody was coupled to Affigel- 10 (Bio- 
Rad) in accordance with the manufacturer's instructions. A 
typical antibody-coupled gel contained, from 1.5 to 4.5 me 
antibody/ml of gel. Columns of 4E1 1 coupled gel of 1.5 ml bed 
volume were prepared in polypropylene columns (Bio-Rad) and 
washed with 15 ml PBS, 15 ml 0.1M glycine HCl. pH 3.0, and 
stored at 4*C in PBS/0.02% sodium axide. 

4E11 column chromatography. Yeast culture filtrates were 
brought to physiolocrical levels of salt and pH by adding 10X PBS 
and made 0 5 mM in CaCl 2 by adding 1 M CaCt 2 , and then loaded 
onto the 1.5 ml column of 4E11 coupled Affigel 10 under grav]ty 
flow. £. coli extracts did not require any further additions because 
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the extraction medium contained physiological levels of salt and 
pH as well as 0.5 mM CaCl 2 . Up to 100 ml of filtrate were passed 
over the column, depending on the level of expression of the 
recombinant protein. After loading, the column was washed with 
three to five alkjuots of 3 ml of PBS containing 0.5 mM CaCl 2 . 
Elution waa carried out with PBS lacking CaCl 2 and containing 
2.0 mM Na 2 EDTA or with 0.IM glycine HC1 pH 3.0. Each 
elution fraction was 1 ml. Yields of purified "proteins were 
determined by amino acid analysis, and were typically 15 — 40% of 
the theoretical maximum assuming a 2: 1 antigen to antibody 
binding ratio. 

Entero kinase treatment. En lero kinase was purified from bo- 
vine intestine by the procedure of Liepnieks and Light 30 . Samples 
were also provided by A. Light of Purdue University. For 
enterokinase treatment, fusion proteins elutcd from the antibody 
column were made 10 mM in Tris-HCI (pH 8) and adjusted to 
pH 8.0 by addition of IN NaOH. For certain samples, the 
reaction mixture was made 40 mM in ociyl-p-D-glucoside. Follow- 
ing the addition of an appropriate amount of bovine enteroki- 
nase (1-10% by weight; typically 0,2-2% by molarity), the reac- 
tion mixture was incubated for J 6 hours at 37°C. Enterokinase 
dilutions were made from a 1 mg/ml stock solution of enzyme in 
10 mM Tris-HCI, pH 8 kept at -70°C. 

Bioaasays. The activity at IL-2 was measured using the murine 
IL-2 dependent T-cell line CTLL-2 90 . The activity of GM-CSF 
was measured in a human bone marrow proliferation assay" and 
the activity of IL-3 was measured by FDC-P2 cell proliferation". 
IL-4 and G-CSF were assayed as described 30 -* 5 . Specific activities 
were derived by measuring the biological activities of purified 
samples of each protein, after quantifying by amino acid analysis. 
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Characterization of the Phe-81 and Val-82 human 
fibroblast collagenase catalytic domain purified from 
Escherichia coli. 

^HBBg MR, Condon B, Margosiak SA, Kan CC. 

Agouron Pharmaceuticals, Inc., San Diego, California 92121, USA. 

Soluble recombinant human fibroblast collagenase catalytic domain was 
highly expressed and purified from Escherichia coli. The expression 
construct utilized the T 7 gene 10 promoter for transcription of a 
two-cistron messenger RNA which encoded the ubiquitin-collagenase 
catalytic domain fusion protein as the second cistron. The ubiquitin 
domain was attached to the collagenase catalytic domain with the linker 
sequences Gly-Gly-Thr-Gly-Asp-Val-Ala-Gln (wild type) or 
Gly-Gly-Thr-Gly-Asp-Val-Gly-His (mutant) which served as cleavage 
sites for in vitro activation. The last four residues of the linker were 
included based on the crystal structure of human prostromelysin-1 
catalytic domain. Soluble fusion proteins purified from E. coli retained 
the proteolytic activity of the collagenase catalytic domain. The 
collagenase catalytic domain was released by either autoproteolytic or 
stromelysin-1 -catalyzed cleavage, purified to homogeneity, and separately 
possess Phe-81, Val-82, or Leu-83 as the amino-terminal residue. Very 
similar kcat/Km values were determined for the Phe-81 and Val-82 forms 
using continuous fluorogenic and chromogenic peptide cleavage assays. 
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High-level production and purification of biologically 
active proteins from bacterial and mammalian cells using 
the tandem pGFLEX expression system. 

s ^lanohara n HT, Gallo J, Gulick AM, Fahl WE. 

McArdle Laboratory for Cancer Research, University of Wisconsin, 
Madison 53706, USA. 

Because of the complexities involved in the regulation of gene expression 
in Escherichia coli and mammalian cells, it is considered general practice 
to use different vectors for heterologous expression of recombinant 
proteins in these host systems. However, we have developed and report a 
shuttle vector system, pGFLEX, that provides high-level expression of 
recombinant glutathione S-transferase (GST) fusion proteins in E. coli 
and mammalian cells. pGFLEX contains the cytomegaloma virus (CMV) 
immediate-early promoter in tandem with the E. coli lacZpo system. The 
sequences involved in gene expression have been appropriately modified 
to enable high-level production of fusion proteins in either cell type. The 
pGFLEX expression system allows production of target proteins fused to 
either the N or C terminus of the GST pi protein and provides rapid 
purification of target proteins as either GST fusions or native proteins 
after cleavage with thrombin. The utility of this vector in identifying and 
purifying a component of a multi -protein complex is demonstrated with 
cyclin A. The pGFLEX expression system provides a singular and widely 
applicable tool for laboratory or industrial production of biologically 
active recombinant proteins in E. coli and mammalian cells. 
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